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1.1 Introduction 

This chapter^ concentrates on a few specific topics concerning the distri- 
bution of galaxies on scales from 0.1 to nearly 1000 Mpc . The main 
aim is to provide the reader with the information and tools to familiarize 
with a few basic questions: 1) What are the scaling laws followed by the 
clustering of luminous objects over almost four decades of scales; 2) How 
galaxy motions distort the observed maps in redshift space, and how we 
can correct and use them to our benefit; 3) Is the observed clustering of 
galaxies suggestive of a fractal Universe; and consequently, 4) Is our faith 
in the Cosmological Principle still well placed, i.e. do we see evidence for 
a homogeneous distribution of matter on the largest explorable scales, in 
terms of the correlation function and power spectrum of the distribution 
of luminous objects. For some of these questions we have a well-defined 
answer, but for some others the idea is to indicate the path along which 
there is still a good deal of exciting work to be done. 



1.2 The clustering of galaxies 

I believe most of the students attending this School are familiar with the 
beautiful cone diagrams showing the distribution of galaxies in what have 

^ Lectures delivered at the Graduate School in Contemporary Relativity and Gravita- 
tional Physics, Como, May 2000 
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been often called slices of the Universe. This has been made possible by the 
tremendous progress in the efficiency of redshift surveys, i.e. observational 
campaigns aimed at measuring the distance of large samples of galaxies 
through the cosmological redshift observed in their spectra. This is one 
of the very simple, yet fundamental pillars of observational cosmology: re- 
constructing the three-dimensional positions of galaxies in space to be able 



to study and characterise statistically their distribution. Figure 1.1 shows 
the current status of the ongoing 2dF survey and gives an idea of the state 
of the art, with ~ 130, 000 redshifts measured and a planned final number 
of 250,000 0. From this plot, the main features of the galaxy distribu- 
tion can be appreciated. One can easily recognize clusters, superclusters 
and voids, and get the feeling on how the galaxy distribution is extremely 
inhomogeneous to at least 50 /i^^Mpc (see for a more comprehensive 
review) . 

The inhomogeneity we clearly see in the galaxy distribution can be 
quantified at the simplest level by asking what is the excess probability 
over random to find a galaxy at a separation r from another one. This 
is one way by which one can define the two-point correlation function, 
certainly the most perused statistical estimator in Cosmology (see for a 
more detailed introduction). When we have a catalogue with only galaxy 
positions on the sky (and usually their magnitudes), however, the first 
quantity we can compute is the angular correlation function w(6). This 
is a projection of the spatial correlation function ^(r) along the redshift 
path covered by the sample. The relation between the angular and spatial 
functions is expressed for small angles by the Limber equation (see [Q and 
for definitions and details) 

/■oo /-oo , . 

w{0)^ J dvv'^(j)'^{v) J dui [y/u^+v^e^j , (1.1) 

where </>(«) is the radial selection function of the two-dimensional catalogue, 
that in this version gives the comoving density of objects at a given distance 
V (which depends, for example, on the magnitude limit of the catalogue and 
the specific luminosity function of the type of galaxies one is studying). 
For optically-selected galaxies [||, Q wiO) is well described by a power-law 
shape cx 9~^-^ , corresponding to a spatial correlation function (r/ro)'' , with 
To — 5h~^ Mpc and 7 ~ —1.8, and a break with a rapid decline to zero 
around scales corresponding to r ~ 30 Mpc . 

The advantage of angular catalogues still at present is the large num- 
ber of galaxies they include, up to a few millions Since the beginning of 
the eighties (e.g. [^), redshift surveys allowed us to compute directly ^(r) 
in three-dimensional space, and the most recent samples have pushed these 
estimates to separations of ~ 100 Mpc (e.g. |^). Figure |l.2| shows the 
two-point correlation function in redshift space^, indicated as S,{s), for a 

^ This means that distances are computed from the red-shift in the galaxy spectrum, 




Figure 1.1. The distribution of tlie neariy 140,000 galaxies observed so far 
(September 2000) in the 2dF survey (from [^): compare this picture to that in 
to see how rapidly this survey is progressing towards its goal of 250,000 redshifts 
measured (note that this is a projection over a variable depth in declination, due 
to the survey being still incomplete). 
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representative set of published redshift surveys |l^, |n|, |T2j. In addition, 
the dotted hues show the real-space ^(r) obtained through de-projection 
of the angular w{9) from the APM galaxy catalogue jl^. The two differ- 
ent lines correspond to two different assumptions about galaxy clustering 
evolution, which has to be taken into account in the de-projection, given 
the depth of the APM survey. This illustrates some of the uncertainties 
inherent in the use of the angular function. As can be seen from the Figure, 
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Figure 1.2. The two-point correlation function of galaxies, as measured from a 
few representative optically-selected surveys (from Q). The plot shows results 
from the ESP §, LCRS APM-Stromlo, Q and Durham- UKST [|| surveys, 
plus the real-space ^(r) de-projected from the angular correlation function ■w{9) 
of the APM survey 111. 



the shape of ^(s) below 5 — 10/i^^Mpc is reasonably well described by 
a power law, but for the four redshift samples the slope is shallower than 
the canonical ~ — 1.8 nicely followed by the APM £,{r). This is due to the 
redshift-space smearing of structures that suppresses the true clustering 
power on small scales, as we shall discuss in the following section. Note 
how f (s) maintains a low-amplitude, positive value out to separations of 
more than 50ft~^Mpc , showing explicitly why large-size galaxy surveys 



neglect 
flow (§ 



ng the Doppler contribution by its peculiar velocity which adds to the Hubble 
1.3) 
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are important: we need large volumes and good statistics to be able to 
extract such a weak clustering signal from the noise. Finally, the careful 
reader might have noticed a small but significant positive change in the 
slope of the APM ^(r) (the only one for which we can see the undistorted 
real-space clustering at small separations), around r ~ 3 — 4 Mpc . On 
scales larger than this, all data show a "shoulder" before breaking down. 
This inflection point appears around the scales where ^ ~ 1, thus suggest- 
ing a relationship with the transition from the linear regime (where each 
mode of the power spectrum grows by the same amount and the shape is 
preserved), to fully nonlinear clustering on smaller scales [0. We shall 



come back to this in 51.4 



1.3 Our distorted view of the galaxy distribution 

We just had an explicit example of how unveiling the true scaling laws 
describing galaxy clustering from redshift surveys is complicated by the ef- 
fects of galaxy peculiar velocities. Separations between galaxies - indicated 
as s to remark this very point - are not measured in real 3D space, but in 
redshift space: what we actually measure when we take the redshift of a 
galaxy is the quantity cz = cztruc + Vpcc/ / , where Wpoc/ / is the component of 
the galaxy peculiar velocity along the line of sight. This quantity, while be- 
ing typically ^ 100 km s^^ for "field" galaxies, can rise above 1000 km s^^ 



in rich clusters of galaxies. As explicitly visible in Figure |l.2| , the resulting 
^(s) is flatter than its real-space counterpart. This is the result of two 
concurrent effects: on small scales, clustering is suppressed by high veloc- 
ities in clusters of galaxies, that spread close pairs along the line of sight 
producing in redshift maps what are sometimes called "Fingers of God" . 



Many of these are recognisable in Figure 1.1 as thin radial structures, par- 
ticularly in the denser part of the upper cone. The net effect on ^(s) is 
in fact to suppress its amplitude below '--^ 1 — 2/i^^Mpc . On the other 
hand, on larger scales where motions are still coherent, streaming flows 
towards higher-density structures enhance their apparent contrast when 
they appear to lie perpendicularly to the line of sight. This, on the con- 
trary, amplifies ^(s) above 10 — 20ft.~^Mpc . Both effects can be better 
appreciated with the help of a computer N-body simulation, for which we 
have the leisure to see both a real- and a redshift-space snapshot, as in 



Figure 1.3 



How could we recover the correlation function of the undistorted spa- 
tial pattern, i.e. £,{r)l This can be accomplished by computing the two- 
dimensional correlation function £,{rp,Tr), where the radial separation s of 
a galaxy pair is split into two components, tt, parallel to the line of sight, 
and Tp, perpendicular to it, defined as follows |]l5j. If di and d2 are the 
distances to the two objects (properly computed) and we define the line of 



4i ^ 
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Figure 1.3. Particle distribution from a 1-degree thick mock survey through a 
large-size Tilted-CDM n-body simulation in real (top) and redshift space (bot- 
tom) . The appearance of the two diagrams gives a clear visual impression of the 
effect of redshift-space distortions (note that here, unlike in the real survey of 
Figure no apparent luminosity selection is applied, i.e. the sample is voJume 
limited) . 
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Figure 1.4. The typical appearance of the bidimensional correlation function 
^(r-p,7r), in this specific case computed for the ESP survey [^. Note the elon- 
gation of the contours along the n direction for small values of Vp, produced 
by high-velocity pairs in clusters. The dashed circles show contours of equal 
correlation in the absence of distortions. 



sight vector 1 = (di + d2)/2 and the redshift difference vector s = di — d2, 
then one defines 

7r=^ TpEES-S-TT^. (1.2) 



111 



The resulting correlation function is a bidimensional m ap, whose contours 
at constant correlation look as in the example of Figure L4. By projecting 
^(rp,7r) along the tt direction, we obtain a function that is independent of 
the distortion. 



Wp{rp) 



2 / dy^R 



(1.3) 



and is directly related to the real-space correlation function (here indicated 
with £,R{r) for clarity), as shown. Modelling £,R{r) as a power law, £,R{r) — 
{r/ro)"'^ we can carry out the integral analytically, yielding 



Wp{rp) 



ro 



7r(i)r(2^) 



r(i) 



(1.4) 
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where T is the Gamma function. Such form can then be fitted to the 
observed Wp{rp) to recover the parameters describing ^(r) (e.g. p6j). Al- 



ternatively, one can perform a formal Abel inversion of Wp{rp) |17 

So far, we have treated redshift-space distortions nearly as only an 
annoying feature that prevents the true distribution of galaxies to be di- 
rectly seen. In fact, being a dynamical effect they carry precious direct 
information on the distribution of mass, independently from the distribu- 
tion of luminous matter. This information can be extracted, in particular 
by measuring the value of the pairwise velocity dispersion ai2{r). This 
is in practice a measure of the small-scale "temperature" of the galaxy 
soup, i.e. the amount of kinetic energy produced by the differences in the 
potential energy created by density fluctuations. Thus, finally, a measure 
of the mass variance on small scales. 

^(rp,7r) can be modelled as the convolution of the real-space corre- 
lation function with the distribution function of pairwise velocities along 
the line of sight |l8|. Let F{w,r) be the distribution function of the 
vectorial velocity differences w = U2 — Ui for pairs of galaxies separated 
by a distance r (so a function of four variables, wi, W2, W3, r). Let ^3 be 
the component of w along the direction of the line of sight (that defined 
by 1); we can then consider the corresponding distribution function of ws, 

f{w3,r)^Jdwidw2F{w,r) , (1.5) 

It is this distribution function that is convolved with ^(r) to produce the 
observed ^(r^, tt). If we now call y the component of the separation r along 
the line of sight, with our convention we have that = Ho{t: — y) and the 
convolution 

l + C(^P,'r) = [l + e(r)]®/(u;3,r) , (1.6) 
can be expressed as 

/+00 
dy{l+i {rl+y^t- } f[Ho{ii-y)] . (1.7) 
-oc 

Note that this expression gives essentially a model description of the ejfect 
produced by peculiar motions on the observed correlations, but does not 
take into account the intimate relation between the mass density distribu- 
tion and the velocity field which is in fact a product of mass correlations 
(see and and references therein). Within this model, therefore, 
we have no specific physical reason for choosing one or another form for 
the distribution function /. Peebles ||2^ first showed that an exponential 
distribution best fits the observed data, a result subsequently confirmed by 
N-body models |2^. According to this choice, / can then be parameterised 
as 



V2ai2(r) 



-V2 



0-12 (?') 



(1.8) 
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where {w3{r)) and ai2{r) are respectively the first and second moment of 
/. The projected mean streaming {w3{r)) is usually explicitly expressed 
in terms of Vi2{r), the first moment of the distribution F defined above, 
i.e. the mean relative velocity of galaxy pairs with separation r, {w3{r)) = 
y vi2{r) /r. The final expression for / becomes therefore 



V2(7i2(r) 



exp ■ 



TT-y 



1 + 



Hot 



0-12 (r) 



(1.9) 



(see e.g. |Tq| and flfl] for more details). 




Figure 1.5. The relative effect of the mean streaming v^ir) and pairwise ve- 
locity dispersion a\2{j) on the shape of the contours of £,{r-p,-n), seen through 



the model of eq. 1.7, While a high pairwise dispersion, cri2 = 700 kms^^ inde- 
pendent on scale is assumed (a reasonable approximation) , the two cases of zero 
mean streaming [F — 0) and stable clustering {F = 1) are considered in the infall 
model of Davis & Peebles IBI. Here the effect of the coherent motions is more 



evident than in the data plot of Figure 1.4: the contours of ^{rp,Tv) are clearly 



compressed along the n direction. This compression is a measure of flj^f /b. 

The practical esti mate of ai2 (r) is typically performed on the data by 



fitting the model of eq. 1/7 to a cut at fixed of the observed £,{rp, tt). To 
do this, one has first to estimate ^(r) from the projected function Wp{rp) 
and choose a model for the mean streaming wi2(0j e.g. that based on 
the similarity solution of the BBGKY equations jg]: 

vi2{r) = -Hor ^— ^ . (1.10) 

The traditional approach considers two extreme cases, corresponding to 
the somewhat idealised situations oi stable clustering {F = 1, a mean infall 
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streaming that compensates exactly the Hubble flow, such that clusters 
are stable in physical coordinates) and free expansion with the Hubble flow 
(i^ = 0, no mean peculiar streaming). It is instructive to see explicitly 
what happens to the contours of Cl'^p?"') iii these two limiting cases. In 
Figure |l]^ 1 have used equations 1.7, and 1.10| to plot the model for 



^(fpjTr), keeping ai2{r) fixed and varying the amplitude F of the mean 
streaming. Here the two competing dynamical effects (small-scale stretch- 
ing and large-scale compression) are clearly evident. The observational 
results yield values of ai2 at small separations around 300 — 400kms~^, 
with a mild dependence on scale jl^, |3[ |l^. This value has been shown 
to be rather sensitive to the survey volume, because of the strong weight 
the technique puts on galaxy pairs in clusters and the fluctuations 
in the number of clusters due to their clustering. A different method has 
been proposed more recently by Landy and collaborators |24| to allevi- 
ate this problem. The method is very elegant, and reduces the weight of 
high-velocity pairs in clusters by working in the Fourier domain where in 
addition the convolution of the two functions becomes a simple product 
of their transforms. A direct application to data and n-body simulations 
under particularly severe survey conditions seems however to give results 
which are not significantly dissimilar to the standard method |^ . 

Rather than assuming a model for the mean streaming vi2{r), one 
could measure it directly from the compression of the contours of C(?'p,7r), 
i.e. doing a simultaneous fit to the first and second moment. This quantity 
also carries important cosmological information, being directly proportional 
to the parameter f3 = il'](.^/b, where flM is the matter density parameter 
and b is the bias parameter of the class of galaxies one is using (see Peacock, 
this volume). This has been done, e.g. on the IRAS 1.2Jy survey Q, but 
the uncertainty on P is very large due to the weak signal and the need 
to simultaneously fit both the first and second moments. The situation 
in this respect will soon improve dramatically thanks to the ongoing 2dF 
§ and Sloan (SDSS) surveys that will provide 250,000 and 1,000,000 
redshifts respectively. 



1.4 Is the Universe fractal? 

The observation of a power-law shape for the two-point correlation func- 
tion together with the self-similar aspect of galaxy maps as that of Fig- 
ure 1.1, suggested several years ago a possible description of the large-scale 
structure of the Universe in terms of fractal objects . A fractal Universe 
without a cross-over to a homogeneous distribution would imply abandon- 
ing the Cosmological Principle. Also, under such conditions most of our 
standard statistical descriptions of large-scale structure would be inap- 
propriate [p8[: no mean density could be defined, and as a consequence 
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the whole concept of density fluctuations (with respect to a mean density) 
would make little sense. 

It is therefore of significant interest: 1) to compare the scaling proper- 
ties of galaxy clustering to those expected for a fractal distribution (keeping 
in mind that on different scales there are different effects at work, as we 
have seen in the previous section); 2) to put under serious scrutiny the 
observational evidences for a convergence of statistical measures to a ho- 
mogeneus distribution within the boundaries of current samples. Attempts 
to address these questions using redshift survey data during the last ten 
years or so have come to different conclusions, mostly because of disagree- 
ment on which data can be used and how they should be treated and 
analysed |3^ . It is because of the relevance of the issues raised that 

this subject has been the focus of an intense debate, as also demonstrated 
by the discussions at this School (see also Montuori, this book). 

1.4.1 Scaling lavi^s 

Let us review the arguments for and against the fractal interpretation of 
the clustering data, by first recalling the basic relations involved. 

A fractal set is characterized by a specific scaling relation, essentially 
describing the way the set fills the ambient space. This scaling law can be 
by itself taken as an heuristic definition of fractal (although it is not strictly 
equivalent to the formal definition in terms of Hausdorff dimensions, see 
e.g. ||3^ ): the number of objects counted in spheres of radius r around a 
randomly chosen object in the set must scale as 

N{r)(xr^ , (1.11) 

where D is the fractal dimension (or more correctly, the fractal correlation 
dimension). . Analogously, the density within the same sphere will scale 
as 

n{r)(xr"-^ . (1.12) 

Similarly, the expectation value of the density measured within shells of 
width dr at separation r from an object in the set, the conditional density 
r(r) will scale in the same way, 

T{r)^A-r^-^ , (1.13) 

with A being constant for a given fractal set. T(r) can be directly connected 
to the standard two-point correlation function ^(r): suppose for a moment 
that we can define a mean density (n) for this sample (we shall see in a 
moment what this implies), then it is easy to show that 

1 + ^(r) = £M oc (^ ;l4) 
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Therefore, if galaxies are distributed as a fractal, a plot of 1 +^(r) will have 
a power-law shape, and in the strong clustering regime (where ^(r) ^ 1) 
this will also be true for the correlation function itself. This demonstrates 
the classic argument (see e.g. ||^), that a power-law galaxy correlation 
function as observed ^(r) = (r/ro )"'*', is consistent with a scale-free, fractal 
clustering with dimension D — (although it does not necessarily imply 
it: fractals are not the only way to produce power-law correlation functions, 
see |3^). Note, however, that when £,{r) '--^ 1 or smaller, only a plot of r(r) 
or 1 + ^(r), and not ^(r), could properly detect a fractal scaling, if present. 

When this happens over a range of scales which is significant with 
respect to the sample size, the mean density (n) becomes an ill-defined 
quantity which depends on the sample size itself. Considering a spherical 
sample with radius Rs and the case of a pure fractal for simplicity, the 



mean density is the integral of eq. 1.12 



M = ^-i?f-^ , (1.15) 

and is therefore a function of the sample radius Rg. Under the same con- 
ditions, the two-point correlation function becomes 



3 \R 



with a correlation length 



/ 6 \ 

ro= l^-j -Rs , (1.17) 

which also depends on the sample size. Therefore, if the galaxy distribution 
has a fractal character, with a well-defined dimension D one should observe 
that: i) The number of objects within volumes of increasing radius N{R) 
grows as R^; ii) Analogously, the function T{r), or equivalently 1 +S,{r), 
is a power law with slope I? — 3; iii) The correlation length Tq is a linear 
function of the sample size. If the fractal distribution extends only up 
to a certain scale, the transition to homogeneity would show up first as 
a flattening of 1 + ^(r) and (less rapidly, given that they depend on an 
integral over r) as a growth N{r) cx and a convergency of Vq to a stable 
value. 



1.4.2 Observational evidences 

Pietronero ||2^ made originally the very important point that the use of 
^(r) was not fully justified, given the size (with respect to the clustering 
scales involved) of the samples available at the time, and the consequent 
uncertainty on the value of the mean density. In reality, this warning was 
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already clear in the original prescription g: one should be confident to 
have a fair sample of the Universe before drawing far-reaching conclusions 
from the correlation function. As it often happens, due to the scarcity of 
data the recommendation was not followed too strictly (see for more 
discussion on this point). 
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Figure 1.6. The function 1 + ^(s) for the same surveys of Figure 1.2. A stable 
power-law scaling would indicate a fractal range. It is clear how peculiar motions, 
that affect all data plotted but the APM ^(r) which is computed in projection, 
do distort significantly the overall shape. What would seem to be an almost 
regular scaling range with D ~ 2 from 0.3 to 30 Mpc , hides in reality a more 
complex structure, with a clear inflection around Sh"^ Mpc , which is revealed 
only when redshift-space effects are eliminated. 

Although the data available today have increased by an order of mag- 
nitude at least, the debate on the scaling properties and homogeneity of the 
Universe is still lively. Given the subject of these lectures and the extensive 
use we have made so far of correlation functions, I shall concentrate here on 
the evidences concerning points ii) and iii) in the summary list above. In 
Fig ure |l.6| , I have plotted the function 1 -|-^(s) for the same surveys of Fig- 



1.2 



Taken at face value, the figure shows that the redshift survey data 
can be reasonably fitted by a single power law only out to ~ 5/i~^Mpc . 
However, as soon as we compare these to the real-space 1 + £^{r) from the 
APM survey, we realize that what we are seeing here is dominated by the 
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redshift-space distortions. In other words, a fractal dimension on small 
scales can only be measured from angular or projected correlations, and if 
the data are interpreted in this way, it is in fact close to I? ~ 1.2. Above 
~ 5/i~^Mpc , a second range follows where D varies between 2 and 3, 
when moving out to scales approaching 100/i~^Mpc . The range between 
5h~^ Mpc and ^ 30 Mpc can in principle be described fairly well by a 
fractal dimension ~ 2, as originally found in |Q, a dimension that could 
perhaps be topological rather than fractal, reflecting a possible sheet-like 
organisation of structures in this range . Above 100 Mpc the func- 
tion 1 + ^(r) seems to be fairly flat, indicating a possible convergence to 
homogeneity. However, once this is established, this kind of plot does not 
allow one to evidence clustering signals of the order of 1%, which can only 
be seen when the contrast with respect to the mean is plotted, i.e. ^(s). For 
a similar analysis and more details, see the pedagogical paper by Martinez 

il- 

Another way of reading the same statistics and on which I would like to 
give an update with respect to |^ is the scaling of the correlation length ro 
with the sample size. It is known that for too small samples there is indeed 
a growth of Vo with the sample size (see e.g. early results in This is 

naturally expected: galaxies are indeed clustered with a power-law corre- 
lation function, and inevitably too small samples will tend statistically to 
overestimate the mean density, when measuring it in a local volume. When 
we consider modern samples, however, and we pay attention to not to com- 
pare apples with pears (galaxies with different morphology and/or different 
luminosity have different correlation properties, [^ ), then the situation is 



more reassuring: Table 1.1 represents an update of that presented in |31| 
and reports the general properties of the four redshift surveys I have used 
so far as examples. Being the survey volumes not spherical, here the "sam- 
ple radius" is defined as that of the maximum sphere contained within the 
survey boundaries (see |^). All these arc estimates of ro in real space. 
The observed correlation lengths are significantly different from the values 



Survey 


d 


Rs 


(predicted) 

1 o 


^ [observed] 


ESP 


~ 600 


5 


1.7 


y:.OU_g 25 


Durham/UKST 


~ 200 


30 


10 


4.6 ± 0.2 


LCRS 


~ 400 


32 


11 


5.0 ±0.1 


Stromlo/APM 


~ 200 


83 


28 


5.1 ± 0.2 



Table 1.1. The behaviour of the correlation length ro for the surveys discussed 
in previous figures, compared to predictions of a = 2 model. All estimates of 
To are in reaJ space, d is the effective depth of the surveys, while the "sample 
radius" Rs has been computed as in js^ . All measures of distance are expressed 
in h'^ Mpc . 
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predicted by the simple D — 2 fractal model. The result would be even 



worse using D = 1.2. The bare evidence from Table 1.1 is that the mea- 
sured values of Tq are remarkably stable, despite significant changes in the 
survey volumes and shapes. 

The counter-arguments in favour of a fractal interpretation of the 
available data are instead summarised in the chapter by M. Montuori. 
As the students can check, the main points of disagreement are related 
to a) the use of some samples whose incompleteness is very difficult to 
assess (as e.g. heterogeneous compilations of data from the literature), 
and b) the estimators used for computing the correlation function and the 
way they take the survey shapes into account. Also on these issues, the 
2dF and SDSS surveys will provide data sets to fully clear the scene. In 
fact, preliminary estimates of the correlation function from the 2dF survey 
provide a result in good agreement with the analyses shown here [|l| . 



1.4.3 Scaling in Fourier space 

It is of interest to spend a few words on the complementary, very important 
view of clustering in Fourier space. The Fourier transform of the correlation 
function is the power spectrum P{k) 

P{k)=iTT r ar)^^^r^dr , (1.18) 
Jo kr 

which describes the distribution of power among different wavevectors or 
modes k = 27r/A once we decompose the fluctuation field 5 = 5p/ p over 
the Fourier basis Q. The amount of information contained in P[k) is 
thus formally the same yielded by the correlation function, although their 
estimates are differently affected by the uncertainties in the data (e.g. 
^). One practical benefit of the description of clustering in Fourier space 
through P{k) is that for fluctuations of very long spatial wavelength (A > 
100ft.~^Mpc ), where ^(r) is dangerously close to zero and errors easily 
make the measured values fluctuate around it (see Figure |1.2D , P{k) is 
on the contrary very large. Around these scales, most models predict a 
maximum for the power spectrum, the fingerprint of the size of the horizon 
at the epoch of matter-radiation equivalence. More technical details on 
power spectra can be found in the chapter by J. Peacock in this same 
book. 



In Fi gur e 1.7, 1 have plotted the estimates of P{k) for the same surveys 
of Figure 1.2. Here again the projected estimate from the APM survey al- 
lows us to disentangle the distortions due to peculiar velocities, which have 
to be taken properly into account in the comparisons to cosmological mod- 
els. Here scales are reversed with respect to ^{r)^ and the effect manifests 
itself in the different slopes above ^ 0.3 h Mpc^^ : an increased slope in real 
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Figure 1.7. The power spectrum of galaxy clustering estimated from the same 
surveys of Figure 1.2 (also from power spectrum estimates from [ ^g] , 

pg]]). Also in Fourier space the differences between real- and redshift-space 
clustering are evident above k ~ 0.2 /iMpc~^ . 



space (dashed line) corresponds to a stronger damping by peculiar veloci- 
ties, diluting the apparent clustering observed in redshift space (all points). 
Below these strongly nonlinear scales, there is a good agreement between 
the slopes of the different samples (with the exception of the LCRS, see 
for discussion), with a well-defined power law range between ~ 0.08 
and ~ 0.3/iMpc^^ . The APM data show a slope ~ k~^-^, corresponding 
to the 7 ~ —1.8 range of ^(r), while at smaller fc's (larger scales) they 
steepen to ~ in agreement with the redshift-space points. It is this 
change in slope that produces the shoulder observed in ^(s) (cf. § |l.2| ). 
Peacock [|o| showed that such spectrum is consistent with a steep linear 
P{k) (~ fc~^'^), the same value originally suggested to explain the shoulder 
when first observed in earlier redshift surveys [p^ . A dynamical interpre- 
tation of this transition scale has been recently confirmed by a re-analysis 
of the APM data 

At even smaller fc's all spectra seem to show an indication for a turnover. 
However, when errors are checked in detail, they are at most consistent 
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with a flattening, with the Durham-UKST survey providing possibly the 
cleanest evidence for a maximum around k ~ 0.03/iMpc~^ or smaller. A 
flattening or a turnover to a positive slope would be an indication for a 
scale over which finally the variance is close to or smaller than that of a 
random (Poisson) process. But we learn by looking at older data that a 
turnover can also be an artifact produced when wavelengths comparable to 
the size of the samples are considered, and here we are close to that case. 

1.5 Do we really see homogeneity? Variance on ~1000 
Mpc scales 

Wu and collaborators Q and Lahav Q nicely reviewed the evidence for 
a convergence to homogeneity on large scales using several observational 
tests. On scales corresponding to spatial wavelengths A ~ 1000 Mpc , 
the constraints on the mean-square density fluctuations are provided essen- 
tially by the smoothness in the X-ray and microwave backgrounds. Mea- 
suring directly the clustering of luminous objects over such enormous vol- 
umes, is only now becoming feasible. The 2dF survey will get close to 
these scales. The SDSS will do even better through a sub-sample of 
early type galaxies selected as to reach a redshift z ~ 0.5. If the goal 
of a redshift survey is mapping density fluctuations on the larges possible 
scales a viable alternative to using single galaxies is represented by clusters 
of galaxies. Here I would like to discuss the properties of the largest of 
such surveys, that is in fact currently producing remarkable results on the 
amount of inhomogeneity on scales nearing 1000 Mpc . 

1.5.1 The REFLEX Cluster Survey 

With mean separations > 10 h^^ Mpc , clusters of galaxies are ideal objects 
for sampling efficiently long-wavelength fluctuations over large volumes 
of the Universe. Furthermore, fluctuations in the cluster distribution are 
amplified with respect to those in galaxies, i.e. they are biased tracers 
of large-scale structure: rich clusters form at the peaks of the large-scale 
density field, and their variance is amplified by a factor that depends on 
their mass, as it was first shown by Kaiser |44| . X-ray selected clusters 
have a further major advantage over galaxies or other luminous objects 
when used to trace and quantify clustering in the Universe: their X-ray 
emission, produced through thermal bremsstrahlung by the thin hot plasma 
permeating their potential well, is a good measure of their mass and this 
allows us to directly compare observations to the predictions of cosmological 
models (see for a review, and for a direct application). 

The REFLEX (ROSAT-ESO Flux Limited X-ray) cluster survey is 
the result of the most intensive effort for a homogeneous identification of 
clusters of galaxies in the ROSAT All Sky Survey (RASS). It combines 
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Figure 1.8. The spatial distribution of X-ray clusters in the REFLEX survey, 
out to 600 Mpc . Note how, despite the coarser mapping of large-scale 
structure, filamentary superclusters ("chains" of clusters) are clearly visible. 



a thorough analysis of the X-ray data , and extensive optical follow-up 
with ESO telescopes, to construct a complete flux-limited sample of about 
700 clusters with measured redshifts and X-ray luminosities The 
survey covers most of the southern celestial hemisphere {S < 2.5°), at 
galactic latitude > 20° to avoid high absorption and stellar crowding. 
The present, fully identified version of the REFLEX survey contains 452 
clusters and is more than 90% complete to a nominal flux limit of 3 x 
gj.g g-i ^^-2 ^jjj ROSAT band, 0.1-2.4 keV). Mean redshifts for 
virtually all these have been measured during a long observing campaign 
with ESO telescopes. Details on the identiflcation procedure and the survey 
properties can be found in , while earlier results are reported in |5l| . 
Figure 1.8 shows the spatial distribution of REFLEX clusters, evi- 
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Figure 1.9. Estimates of the power spectrum of X-ray clusters from flux-limited 
sub-samples of the the REFLEX survey, framed within Fourier boxes of 300 
(open squares), 400 (filled hexagons), and 500 (open hexagons) h~^ Mpc side, 
containing 133, 188 and 248 clusters, respectively. The two lines correspond to 
the best-fitting parameters using a phenomenological shape with two power laws 
(solid), or a A-CDM model, with Qm = 0.3 and = 0.7 (dashed) (from [Q). 




dencing a number of superstructures with sizes ~ 100 ft.^^ Mpc . One of 
the main motivations for this survey was to compute the power spectrum 
on extremely large scales, benefiting of the efficiency of cluster samples to 
cover very large volumes of the Universe. Figure 1.9 shows the estimates 
of P{k) from three subsamples of the survey (from^^). 

One of the strong advantages of working with X-ray selected clusters of 
galaxies is that connection to model predictions is far less ambiguous than 
with optically-selected clusters (e.g. ||5^, ^). We have therefore used the 
specific REFLEX selection function (converted essentially to a selection in 
mass), to determine that a Iow-Qm model (open or A-dominated), best 
matches both the shape and amplitude (i.e. bias value) of the observed 
power spectrum ji^ (dashed line in the figure). In fact, the samples shown 
here do not reach the maximum spatial wavelengths we can possibly sample 
with the current data, as the Fourier box could be made to be as large 
as 1000 h~^ Mpc (the survey reaches z = 0.3 with the most luminous 
objects). In such case, however, our control over systematic effects becomes 
poorer, and work is currently undergoing to pin errors down and understand 
how trustable are our results on ~ 1 Gpc scale, where we do see extra 
power coming up. At the very least, REFLEX is definitely showing more 
clustering power on very large scales than any galaxy redshift survey to 
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Figure 1.10. The two-point correlation function of the whole flux-limited RE- 
FLEX cluster catalogue (fiUed circles, |5^]), compared to that of ESP galaxies 
(open circles, The dashed lines show the Fourier transform of a phenomeno- 
logical flt to P{k) which tries to include the large-scale power seen from the 
largest subsamples (top line). The bottom curve is that obtained after scaling 
down by an arbitrary bias factor [b^ = (3.3)^ in this specific case). 



date. Similar hints for large-scale inhomogeneities seem to be suggested 
by the most recent analysis of Abell-ACO samples ]54[ |. 

For k > 0.05 /iMpc^^ , on the other hand, a comparison of REFLEX 
to galaxy power spectra shows a rather similar shape. This is probably 
better appreciated by lo oking at the two-point correlation function i^(s) 

compared in Figure 1.10 to that of the ESP galaxy redshift survey. 
The agreement in shape between galaxies and clusters is remarkable on 
all scales, with a break to zero around 60 — 70/i~^Mpc for both classes 
of objects. This is in general expected in a simple biasing scenario where 
clusters represent the high, rare peaks of the mass density distribution. 
This result strongly corroborates the simpler, reassuring view that at least 
above ~ 5/i~^Mpc the galaxy and mass distributions are linked by a 
simple constant bias. 
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1.5.2 "Peaks and Valleys" in the Power Spectrum 

Most of the discussion so far has been concentrating on the beauty of find- 
ing "smooth" simple shapes for ^(r) or P{k), as symptoms of an underlying 
order of Nature. Rather than being a demonstration of Nature inchnation 
for elegance, however, this smoothness and simplicity might simply indicate 
our ignorance and lack of data. In fact, while smooth power spectra are 
predicted in models dominated by non-interacting dark matter particles, 
as Cold Dark Matter, a very different situation is expected in cases where 
ordinary (baryonic) matter plays a more significant role, with wiggles ap- 
pearing in P{k) that would be difficult to detect with the size and "Fourier 
resolution" of our current data sets. 

The possibility that the power spectrum shows a sharp peak (or more 
peaks) around its maximum has been suggested a few times during the last 
few years. For example, Einasto and collaborators | |55[ | found evidence for 
a sharp peak around k ~ 0.05 h Mpc^^ in the power spectrum of an earlier 
sample of Abell clusters, a feature later confirmed with lower significance 
by a more conservative analysis of the same data [ p6[ . The position of this 
feature is remarkably close to the ^ 130/i^^Mpc "periodicity" revealed 
by Broadhurst and collaborators in a "pencil-beam" survey towards the 
galactic poles , and more recently, in an analysis of the redshift distri- 
bution of Lyman-break selected galaxies . Other evidences have been 
claimed from two-dimensional analyses of redshift "slices" |^^, or QSO 
superstructures jGOf . 

These observations have stimulated some interesting work on models 
with high baryonic content. In this case, the power spectrum can exhibit 
a detectable inprint from "acoustic" oscillations within the last scattering 
surface at z ^ 1000, the same features observed in the Cosmic Microwave 
Background (CMB) radiation (|^). While the most recent estimates of 
the REFLEX power spectrum do not show clear features around the scales 
of interest to justify "extreme" high-baryon models (contrary to early in- 
dications which shows the importance of the careful assessment of 
errors), the extra power below k ~ 0.02 could still be an indication of 
an higher-than-conventional baryon fraction |6^ , along the lines that 
seem to be suggested by the Boomerang CMB results . 

1.6 Conclusions 

At the end of these lectures, a student is possibly more confused than 
he/she was in the beginning, at least after a first read. I hope however 
that once the dust settles, a few important points emerge. First, that the 
processes which shaped the large-scale distribution of luminous objects we 
observe today are different at different scales. At small scales, we observe 
essentially the outcome of fully nonlinear gravitational evolution that re- 
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shaped the hnear power spectrum into a collection of virialised, or nearly 
so structures. Therefore, one cannot naively take the redshift survey data 
and look for specific patterns or statistical properties without taking into 
account galaxy peculiar motions. For this reason, one should be careful 
in over-interpreting things like a single power-law scaling from scales of a 
tenth of a Megaparsecs to hundred Megaparsecs, because, again, different 
phenomena are being compared. On the contrary, one can use these dis- 
tortions to really "see" how the true mass distribution is, and I have spent 
a considerable part of these lectures to describe some of the techniques in 
use. Moving to larger and larger scales, we enter a regime where we are 
lucky enough that we can still see something related to the original scaling 
law of fluctuations. This is what was originally produced by some genera- 
tor in the early Universe (inflation?) and processed through a matter (dark 
plus baryons) controlled amplifier. On even larger scales, we hope we are 
finally entering a regime where the variance in the mass is consistent with 
a homogeneous distribution, although we have seen that even the largest 
galaxy and cluster samples are barely sufficient to see hints of that, per- 
haps suggesting even more inhomogeneity than we expect. Does this mean 
that we are living in a pure fractal Universe? The scaling behaviour of 
galaxies and the stability of the correlation length seem to imply that this 
cannot be the case. On top of everything, the smoothness of the Cosmic 
Microwave Background (treated elsewhere in this book) is probably the 
most reassuring observation in this respect. What we seem to understand 
is that our samples have still difficulty to properly sample the very largest 
fluctuations of the density field, on scales where this is not fully Poissonian 
(or sub-Poissonian) yet. 

Finally, I hope the students get the message that despite the tremen- 
dous progress of the last 25 years which transformed Cosmology into a 
real science, we still have a number of fascinating questions to answer and 
still feel far away from convincing ourselves that we have understood the 
Universe. 
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